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1. A. method for providing at least one self-tuning object 
to a i\ser program, comprising: 

receiving said user program; 
simulating execution of said user program; 
detecting, during said simulating of said execution of 
said user program, a plurality of expressions including said 
self-tuning ob\ect in said user program; 

generatingX in response to said detecting said 
plurality of expressions including said self -tuning object 
in said user program, a trace file indicating a sequence of 
said expressions including said self-tuning object in said 
user program; 

dividing said trace\file into a plurality of trace file 
blocks; 

converting said trace\ file blocks into source code 
expression blocks; 

generating a plurality 6d: minimal timing, compiled 
expression blocks, each of said plurality of minimal timing, 
compiled expression blocks corresponding to a respective one 
of said source code expression blocks, said generating said 
plurality of minimal timing, compiled expression blocks 
including application of at least one compiler optimization 
technique ; and 

linking said plurality of minimal \iming, compiled 
expression blocks into said user program. 



2. The method of claim 1, wherein said detecting said 
30 plurality of expressions including said self-tunijig object 
in said user program is performed by program code associated 
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w-ith at least one overloaded operator associated with said 
\ 

self -tuning object. 

3. The\method of claim 1, wherein said generating a trace 
file indicating the sequence of said expressions including 
said self-tuning object in said user program is performed by 
program code associated with at least one overloaded 
operator associatecKwith said self-tuning object. 

4 . The method of claim :l, wherein said dividing said trace 
file into said plurality cxf trace file blocks is performed 
such that a total amount of\computational dependencies and 
synchronization requirements Nwithin said user program, 
including computational dependencies and synchronization 
requirements between trace file blocks, are minimized. 

5. The method of claim 1, wherein said\dividing said trace 
file into said plurality of trace file blocks is performed 
responsive to user provided delimiters included within said 
user program. \^ 

6. The method of claim 1, wherein said generating said 
plurality of minimal uiming, compiled expression blocks 
further comprises compiling and executing at least one of 
said expression blocks mulViple times while varying a value 
of at least one optimizationrpar^fneter for said at least one 
compiler optimization technirqUe. 

7. The method of claim 6, wherein said generating said 
plurality of minimal timing, vcompiled expression blocks 
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further comprise ^Osim^Lng said multiple executions of said 
compiled expression\blocks . 

8. \he method of claim 1, wherein said linking of said 
minima\L timing, compiled expression blocks to said user 
program\is responsive to execution of said user program. 

9. The method of claim 8, wherein said linking of said 
minimal timing, compiled expression blocks further comprises 
detecting, during said execution of said user program, said 
plurality of \ expressions including said self-tuning object 
in said user program. 

10. The method \of claim 9, wherein said linking of said 
minimal timing, compiled expression blocks further comprises 
scheduling said minimal timing, compiled expression blocks 
for execution on at\least one processor of a target parallel 
processing computer . 

11. A computer programXproduct including a computer 
readable medium, said computer readable medium having at 
least one computer programs stored thereon, said at least one 
computer program comprisingN 

program code for receiving said user program; 
program code for simulating execution of said user 
program; 

program code for detectingX during said simulating of 
said execution of said user {program, a plurality of 
expressions including said self-tuning object in said user 
program; 
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) program code for generating, in response to said 
detecting said plurality of expressions including said self- 
tuning \object in said user program, a trace file indicating 
a sequence of said expressions including said self-tuning 
5 object in said user program; 

program code for dividing said trace file into a 
plurality o^ trace file blocks; 

programVode for converting said trace file blocks into 
source code expression blocks; 
10 program code for generating a plurality of minimal 

timing, compiled\ expression blocks, each of said plurality 
of minimal timingX compiled expression blocks corresponding 
to a respective one, of said source code expression blocks, 
said generating said\ plurality of minimal timing, compiled 
15 expression blocks inoluding application of at least one 
compiler optimization technique; and 

program code for Tanking said plurality of minimal 
timing, compiled expression, blocks into said user program. 

20 12. The computer program product of claim 11, wherein said 
program code for detecting skid plurality of expressions 
including said self-tuning object in said user program 
comprises program code associated with at least one 
overloaded operator associated witn\said self-tuning object. 

25 \ 

13. The computer program product of Vlaim 11, wherein said 
program code for generating a trace \ file indicating the 
sequence of said expressions including said self-tuning 
object in said user program comprises program code 

30 associated with at least one overloaded operator associated 
with said self-tuning object. \ 
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14. \ The computer program product of claim 11, wherein said 
program code for dividing said trace file into said 
pluraliTw of trace file blocks is operative to divide said 
trace file\into said plurality of trace file blocks such 
that a totals, amount of computational dependencies and 
synchronization ^acquirements within said user program, 
including computational dependencies and synchronization 
requirements between trc^ce file blocks, are minimized. 

15. The computer program product of claim 11, wherein said 
program code for dividing >said trace file into said 
plurality of trace file blocks re operative to divide said 
trace file into said plurality\ of trace file blocks 
responsive to user provided delimiters included within said 
user program. \ 

16. The computer program product of claim 11, wherein said 
program code for generating said plurality of minimal 
timing, compiled expression blocks further comprises program 
code for compiling and\ executing at least one of said 
expression blocks multiples times while varying a value of at 
least one optimization parameter for said at least one 
compiler optimization tecmnwu^: 

17. The computer program product of claim 16, wherein said 
program code for generating >said plurality of minimal 
timing, compiled expression blockite further comprises program 
code for timing said multiple executions of said compiled 
expression blocks. N 
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18-^The computer program product of claim 11, wherein said 

progi^am code for linking of said minimal timing, compiled 

expression blocks to said user program is triggered by 
execution of said user program. 



19. The computer program product of claim 18, wherein said 
linking of\said minimal timing, compiled expression blocks 
further comprises program code for detecting, during said 
execution of\ said user program, said plurality of 
expressions including- said self-tuning object in said user 
program. 

20. The computer program product of claim 19, wherein said 
program code for linking of said minimal timing, compiled 
expression blocks farther comprises program code for 
scheduling said minimal timing, compiled expression blocks 
for execution on at leas\ one processor of a target parallel 
processing computer. 

21. The computer program product of claim 11, wherein said 
computer program comprises a (Compiler, 



22. A computer data signal embodied in a carrier wave, said 
computer data signal including at \east one computer 
program, said at least one computer program comprising: 

program code for receiving said wser program; 

program code for simulating execution of said user 
program; 

program code for detecting, during \said simulating of 
said execution of said user program, \ a plurality of 
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^cpressions including said self-tuning object in said user 
program; 

\ program code for generating, in response to said 
detecting said plurality of expressions including said self- 
tuning\pbject in said user program, a trace file indicating 
a sequence of said expressions including said self-tuning 
object in said user program; 

program\ code for dividing said trace file into a 
plurality of t\ace file blocks; 

program coofe for converting said trace file blocks into 
source code expression blocks; 

program code Vor generating a plurality of minimal 
timing, compiled expression blocks, each of said plurality 
of minimal timing, compiled expression blocks corresponding 
to a respective one of \said source code expression blocks, 
said generating said plurality of minimal timing, compiled 
expression blocks including application of at least one 
compiler optimization technique; and 

program code for linkirrg said plurality of minimal 
timing, compiled expression blooks into said user program. 

23. A system for providing at leaVt one self-tuning object 
to a user program, comprising: \ 
at least one processor; \ 

at least one memory communicably ooupled to said at 
least one processor; \ 

a computer program for execution on 3aid processor, 
said computer program stored in said memory\ said computer 
program comprising: \ 

program code for receiving said useis program; 
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/ program code for simulating execution of said user 
program; 



program code for detecting, during said simulating 
of said execution of said user program, a plurality of 
expressions including said self-tuning object in said 
user program; 

program code for generating, in response to said 
detecting saSid plurality of expressions including said 
self-tuning oVject in said user program, a trace file 
indicating a sequence of said expressions including 
said self-tuning\pbject in said user program; 

program codeVfor dividing said trace file into a 
plurality of trace ^file blocks; 

program code foV converting said trace file blocks 
into source code expression blocks; 

program code for generating a plurality of minimal 
timing, compiled expression blocks, each of said 
plurality of minimal timing, compiled expression blocks 
corresponding to a respective one of said source code 
expression blocks, said generating said plurality of 
minimal timing, compiled expression blocks including 
application of at least one^ compiler optimization 
technique; and \ 

program code for linking sard plurality of minimal 
timing, compiled expression blocks into said user 
program. \ 

24. A system for providing at least one s>elf-tuning object 

to a user program, comprising: \ 
means for receiving said user program; \ 
means for simulating execution of said user program; 
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means for detecting, during said simulating of said 
execution of said user program, a plurality of expressions 
including said self-tuning object in said user program; 

means^ for generating, in response to said detecting 
5 said plurality of expressions including said self-tuning 
object in said user program, a trace file indicating a 
sequence of saibL expressions including said self-tuning 
object in said user program; 

means for dividin^\said trace file into a plurality of 
10 trace file blocks; 

means for converting sait| trace file blocks into source 
code expression blocks; 

means for generating a plOKrality of minimal timing, 
compiled expression blocks, eacny of said plurality of 
15 minimal timing, compiled expression xjlocks corresponding to 
a respective one of said source code expression blocks, said 
generating said plurality of minimaJ\ timing, compiled 
expression blocks including application \}f at least one 
compiler optimization technique; and 
20 means for linking said plurality of m\nimal timing, 

compiled expression blocks into said user progrc 
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